279 research outputs found

    Genomic Transition to Pathogenicity in Chytrid Fungi

    Get PDF
    Understanding the molecular mechanisms of pathogen emergence is central to mitigating the impacts of novel infectious disease agents. The chytrid fungus Batrachochytrium dendrobatidis (Bd) is an emerging pathogen of amphibians that has been implicated in amphibian declines worldwide. Bd is the only member of its clade known to attack vertebrates. However, little is known about the molecular determinants of - or evolutionary transition to - pathogenicity in Bd. Here we sequence the genome of Bd's closest known relative - a non-pathogenic chytrid Homolaphlyctis polyrhiza (Hp). We first describe the genome of Hp, which is comparable to other chytrid genomes in size and number of predicted proteins. We then compare the genomes of Hp, Bd, and 19 additional fungal genomes to identify unique or recent evolutionary elements in the Bd genome. We identified 1,974 Bd-specific genes, a gene set that is enriched for protease, lipase, and microbial effector Gene Ontology terms. We describe significant lineage-specific expansions in three Bd protease families (metallo-, serine-type, and aspartyl proteases). We show that these protease gene family expansions occurred after the divergence of Bd and Hp from their common ancestor and thus are localized to the Bd branch. Finally, we demonstrate that the timing of the protease gene family expansions predates the emergence of Bd as a globally important amphibian pathogen

    Efficient representation of uncertainty in multiple sequence alignments using directed acyclic graphs

    Get PDF
    Background A standard procedure in many areas of bioinformatics is to use a single multiple sequence alignment (MSA) as the basis for various types of analysis. However, downstream results may be highly sensitive to the alignment used, and neglecting the uncertainty in the alignment can lead to significant bias in the resulting inference. In recent years, a number of approaches have been developed for probabilistic sampling of alignments, rather than simply generating a single optimum. However, this type of probabilistic information is currently not widely used in the context of downstream inference, since most existing algorithms are set up to make use of a single alignment. Results In this work we present a framework for representing a set of sampled alignments as a directed acyclic graph (DAG) whose nodes are alignment columns; each path through this DAG then represents a valid alignment. Since the probabilities of individual columns can be estimated from empirical frequencies, this approach enables sample-based estimation of posterior alignment probabilities. Moreover, due to conditional independencies between columns, the graph structure encodes a much larger set of alignments than the original set of sampled MSAs, such that the effective sample size is greatly increased. Conclusions The alignment DAG provides a natural way to represent a distribution in the space of MSAs, and allows for existing algorithms to be efficiently scaled up to operate on large sets of alignments. As an example, we show how this can be used to compute marginal probabilities for tree topologies, averaging over a very large number of MSAs. This framework can also be used to generate a statistically meaningful summary alignment; example applications show that this summary alignment is consistently more accurate than the majority of the alignment samples, leading to improvements in downstream tree inference. Implementations of the methods described in this article are available at http://statalign.github.io/WeaveAlign webcite

    LRR-RLK family from two Citrus species: Genome-wide identification and evolutionary aspects

    Get PDF
    Background: Leucine-rich repeat receptor-like kinases (LRR-RLKs) represent the largest subfamily of plant RLKs. The functions of most LRR-RLKs have remained undiscovered, and a few that have been experimentally characterized have been shown to have important roles in growth and development as well as in defense responses. Although RLK subfamilies have been previously studied in many plants, no comprehensive study has been performed on this gene family in Citrus species, which have high economic importance and are frequent targets for emerging pathogens. In this study, we performed in silico analysis to identify and classify LRR-RLK homologues in the predicted proteomes of Citrus clementina (clementine) and Citrus sinensis (sweet orange). In addition, we used large-scale phylogenetic approaches to elucidate the evolutionary relationships of the LRR-RLKs and further narrowed the analysis to the LRR-XII group, which contains several previously described cell surface immune receptors. Results: We built integrative protein signature databases for Citrus clementina and Citrus sinensis using all predicted protein sequences obtained from whole genomes. A total of 300 and 297 proteins were identified as LRR-RLKs in C. clementina and C. sinensis, respectively. Maximum-likelihood phylogenetic trees were estimated using Arabidopsis LRR-RLK as a template and they allowed us to classify Citrus LRR- 34 RLKs into 16 groups. The LRR-XII group showed a remarkable expansion, containing approximately 150 paralogs encoded in each Citrus genome. Phylogenetic analysis also demonstrated the existence of two distinct LRR-XII clades, each one constituted mainly by RD and non-RD kinases. We identified 68 orthologous pairs from the C. clementina and C. sinensis LRR-XII genes. In addition, among the paralogs, we identified a subset of 78 and 62 clustered genes probably derived from tandem duplication events in the genomes of C. clementina and C. sinensis, respectively. Conclusions: This work provided the first comprehensive evolutionary analysis of the LRR-RLKs in Citrus. A large expansion of LRR-XII in Citrus genomes suggests that it might play a key role in adaptive responses in host-pathogen co-evolution, related to the perennial life cycle and domestication of the citrus crop species

    Persistence of single species of symbionts across multiple closelyrelated host species

    Get PDF
    Some symbiont species are highly host-specific, inhabiting only one or a very few host species, and typically have limited dispersal abilities. When they do occur on multiple host species, populations of such symbionts are expected to become genetically structured across these different host species, and this may eventually lead to new symbiont species over evolutionary timescales. However, a low number of dispersal events of symbionts between host species across time might be enough to prevent population structure and species divergence. Overall, processes of evolutionary divergence and the species status of most putative multi-host symbiont systems are yet to be investigated. Here, we used DNA metabarcoding data of 6,023 feather mites (a total of 2,225 OTU representative sequences) from 147 infracommunities (i.e., the assemblage consisting of all mites of different species collected from the same bird host individual) to investigate patterns of population genetic structure and species status of three different putative multi-host feather mite species Proctophyllodes macedo Vitzthum, 1922, Proctophyllodes motacillae Gaud, 1953, and Trouessartia jedliczkai (Zimmerman, 1894), each of which inhabits a variable number of different closely related wagtail host species (genus Motacilla). We show that mite populations from different host species represent a single species. This pattern was found in all the mite species, suggesting that each of these species is a multi-host species in which dispersal of mites among host species prevents species divergence. Also, we found evidence of limited evolutionary divergence manifested by a low but significant level of population genetic structure among symbiont populations inhabiting different host species. Our study agrees with previous studies showing a higher than expected colonization opportunities in host-specific symbionts. Indeed, our results support that these dispersal events would allow the persistence of multi-host species even in symbionts with limited dispersal capabilities, though additional factors such as the geographical structure of some bird populations may also play a role.This work was supported by the MINECO CGL2011-24466 to RJ and CGL2015-69650-P to RJ and DS

    Programmed DNA elimination of germline development genes in songbirds

    Get PDF
    In some eukaryotes, germline and somatic genomes differ dramatically in their composition. Here we characterise a major germline–soma dissimilarity caused by a germline-restricted chromosome (GRC) in songbirds. We show that the zebra finch GRC contains >115 genes paralogous to single-copy genes on 18 autosomes and the Z chromosome, and is enriched in genes involved in female gonad development. Many genes are likely functional, evidenced by expression in testes and ovaries at the RNA and protein level. Using comparative genomics, we show that genes have been added to the GRC over millions of years of evolution, with embryonic development genes bicc1 and trim71 dating to the ancestor of songbirds and dozens of other genes added very recently. The somatic elimination of this evolutionarily dynamic chromosome in songbirds implies a unique mechanism to minimise genetic conflict between germline and soma, relevant to antagonistic pleiotropy, an evolutionary process underlying ageing and sexual traits

    From community approaches to single-cell genomics: the discovery of ubiquitous hyperhalophilic Bacteroidetes generalists

    Get PDF
    The microbiota of multi-pond solar salterns around the world has been analyzed using a variety of culture-dependent and molecular techniques. However, studies addressing the dynamic nature of these systems are very scarce. Here we have characterized the temporal variation during 1 year of the microbiota of five ponds with increasing salinity (from 18% to >40%), by means of CARD-FISH and DGGE. Microbial community structure was statistically correlated with several environmental parameters, including ionic composition and meteorological factors, indicating that the microbial community was dynamic as specific phylotypes appeared only at certain times of the year. In addition to total salinity, microbial composition was strongly influenced by temperature and specific ionic composition. Remarkably, DGGE analyses unveiled the presence of most phylotypes previously detected in hypersaline systems using metagenomics and other molecular techniques, such as the very abundant Haloquadratum and Salinibacter representatives or the recently described low GC Actinobacteria and Nanohaloarchaeota. In addition, an uncultured group of Bacteroidetes was present along the whole range of salinity. Database searches indicated a previously unrecognized widespread distribution of this phylotype. Single-cell genome analysis of five members of this group suggested a set of metabolic characteristics that could provide competitive advantages in hypersaline environments, such as polymer degradation capabilities, the presence of retinal-binding light-activated proton pumps and arsenate reduction potential. In addition, the fairly high metagenomic fragment recruitment obtained for these single cells in both the intermediate and hypersaline ponds further confirm the DGGE data and point to the generalist lifestyle of this new Bacteroidetes group.This work was supported by the projects CGL2012-39627-C03-01 and 02 of the Spanish Ministry of Economy and Competitiveness, which were also co-financed with FEDER support from the European Union. TG group research is funded in part by a grant from the Spanish Ministry of Economy and Competitiveness (BIO2012-37161), a grant from the Qatar National Research Fund grant (NPRP 5-298-3-086) and a grant from the European Research Council under the European Union’s Seventh Framework Programme (FP/2007-2013)/ERC (grant agreement no. ERC-2012-StG-310325)

    Dissemination of Cephalosporin Resistance Genes between Escherichia coli Strains from Farm Animals and Humans by Specific Plasmid Lineages

    Get PDF
    Third-generation cephalosporins are a class of β-lactam antibiotics that are often used for the treatment of human infections caused by Gram-negative bacteria, especially Escherichia coli. Worryingly, the incidence of human infections caused by third-generation cephalosporin-resistant E. coli is increasing worldwide. Recent studies have suggested that these E. coli strains, and their antibiotic resistance genes, can spread from food-producing animals, via the food-chain, to humans. However, these studies used traditional typing methods, which may not have provided sufficient resolution to reliably assess the relatedness of these strains. We therefore used whole-genome sequencing (WGS) to study the relatedness of cephalosporin-resistant E. coli from humans, chicken meat, poultry and pigs. One strain collection included pairs of human and poultry-associated strains that had previously been considered to be identical based on Multi-Locus Sequence Typing, plasmid typing and antibiotic resistance gene sequencing. The second collection included isolates from farmers and their pigs. WGS analysis revealed considerable heterogeneity between human and poultry-associated isolates. The most closely related pairs of strains from both sources carried 1263 Single-Nucleotide Polymorphisms (SNPs) per Mbp core genome. In contrast, epidemiologically linked strains from humans and pigs differed by only 1.8 SNPs per Mbp core genome. WGS-based plasmid reconstructions revealed three distinct plasmid lineages (IncI1- and IncK-type) that carried cephalosporin resistance genes of the Extended-Spectrum Beta-Lactamase (ESBL)- and AmpC-types. The plasmid backbones within each lineage were virtually identical and were shared by genetically unrelated human and animal isolates. Plasmid reconstructions from short-read sequencing data were validated by long-read DNA sequencing for two strains. Our findings failed to demonstrate evidence for recent clonal transmission of cephalosporin-resistant E. coli strains from poultry to humans, as has been suggested based on traditional, low-resolution typing methods. Instead, our data suggest that cephalosporin resistance genes are mainly disseminated in animals and humans via distinct plasmids
    corecore